DriveLMM-o1 is a fine-tuned large multimodal model optimized for autonomous driving, based on the InternVL2.5-8B architecture and adapted using LoRA technology, achieving step-by-step reasoning through stitched multi-view images.
Multimodal Fusion
Transformers English